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Abstract 

In this paper, we study the problem of recovering a sharp version of a given blurry image when the blur kernel 
is unknown. Previous methods often introduce an image-independent regularizer (such as Gaussian or sparse priors) on 
the desired blur kernel. For the first time, this paper shows that the blurry image itself encodes rich information about 
the blur kernel. Such information can be found through analyzing and comparing how the spectrum of an image as a 
convolution operator changes before and after blurring. Our analysis leads to an effective convex regularizer on the blur 
kernel which depends only on the given blurry image. We show that the minimizer of this regularizer guarantees to give 
good approximation to the blur kernel if the original image is sharp enough. By combining this powerful regularizer with 
conventional image deblurring techniques, we show how we could significantly improve the deblurring results through 
simulations and experiments on real images, especially when the blur is large. In addition, our analysis and experiments 
help explaining why edges are good features for image deblurring. 
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I. Introduction 

Figure [TJ shows a very common result of a very blurry image of a car taken by a moving camera - or similar blurring 
effect can be observed in many surveillance photos where the camera is static but the car is moving. In many situations, 
we would like to recover the sharp version of the image so that details of the image (such as the numbers on the license 
plate) become recognizable again. It is in general impossible to correctly deblur the whole image if both the camera 
motion and the scene geometry are both entirely unknown. Nevertheless, if we only consider a small image region (e.g., 
the license plate area shown in Figure [TJ, it is reasonable to assume that the blurry image, denoted as B g R™ lX ™ 2 , 
is approximately generated by the convolution of a sharp image, denoted as Iq, and a common blur kernel, denoted as 
K e K m ^ xm \ £j]: 

BttI ®K , s.t. K eS, 

where (g) denotes the discrete 2D convolution operator, S denotes the simplex of all possible blur kernels (i.e., nonnegative 
and sums to one), ni, n-i are the image sizes, and m\,mi are the sizes of the blur kernel. So, the blind image deblurring 
problem is mathematically formulated as the problem of blind deconvolution JT], 0, @, which is to recover the sharp 
image Iq when the blur kernel Kq is unknown Q. 

The blind decovolution problem has been being investigated for several decades widely in optical society, image 
processing, computer graphics, and computer vision JT], 0, (6), Q, 0, (9). Yet, as recently surveyed in Q, this 
problem is still far from being solved in the more general cases even with the notable progresses made recently (e.g., 

ED, HUJ, E2, ED). 

1 non-blind deconvolution is the restoration of Iq when Ko is given. 





Fig. 1. A scenario of image deblurring. We take a picture of size 1624 X 2448 by using a NEX-5N camera. The picture is blurry due to an 
uncontrolled camera motion. We want to process the picture such that human eyes are able to recognize its details (e.g., the plate number). 
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A straightforward approach for blind deconvolution is to jointly seek the sharp image Iq and the blur kernel Kq by 
minimizing 

minllS - I® K\\%, s.t. KeS, 

I,K 

where || • \\p denotes the Frobenius norm of a matrix. However, this problem is highly ill-conditioned, as it can be perfectly 
minimized by infinite number of pairs (7, K). For example, the no-blur explanation is a perfect solution: K = S (a delta 
function) and I = B. Indeed, even if the blur kernel Kq is given, the non-blind deconvolution problem can still be 
ill-conditioned 0), fl4l . So, in general, it is necessary to regularize the desired solution for the image I: 

mm\\B-I®K\\ F + \f(I), s.t. K e S, (1) 

where A > is a parameter. The regularizer /(/) is usually chosen as the total variation 031 or its variations flU, lfl4l . 
Ifl6l . However, as analyzed in [7 |, such image gradients based regularizer generally favors a blurry solution over a sharp 
one! Actually, whenever /(/) is convex, it is easy to prove that (see the Appendix) 

f(I®K)<f(I), V/eM" 1 '" 2 , VK €S; (2) 

that is, minimizing /(/) will encourage blurring, and thus the no-blur explanation (i.e., I = B and K = S) is always 
favored by minimizing (|T). Hence, it is critical to regularize the kernel K, resulting in the following optimization problem: 

vam\\B-I®K\\ F + \f(I)+ah(K), s.t. KeS, (3) 

where A > 0, a > are two parameters. 

While it is obviously that the regularizer h(K) plays important role in blind deconvolution, the existing proposals 
for h(K) (e.g., the Gaussian prior h(K) = \\K\\ 2 F 0, 07) and the sparse prior h(K) = \\K\\ X UJ), DJD) have no 
significant effect on constraining the blur kernel (TJ. Although such objectives can work well on some simple cases 
where there is no serious blurU, they could not handle more difficult deblurring tasks such as images with a significant 
blur (e.g., Figure [TJ. Moreover, the Gaussian and sparse priors are inaccurate for characterizing real world blur kernel, 
which is usually the combination of a sparse curve-like kernel (which corresponds to the camera motion) and a dense 
Gaussian-like kernel (that models effects such as out of focus). 

In this work, we derive a much more effective regularizer for the blur kernel that can significantly benefit the solution 
of the blind deconvolution problem. This regularizer is based on a simple but important observation about the spectral 
properties of an image as a convolution operator: For a given image (i.e., matrix), consider its convolution with any 
other image. The convolution defines a linear operator. Then empirically, the spectrum (the set of eigenvalues) of this 
linear operator for a blurry image is significantly smaller than that for its sharp counter part (see Figure 2). In fact, 
this can be proven to be true to some extend. Based on this observation, we devise a convex regularizer that tends to be 
minimized at the true blur kernel Kq. Namely, given an observed image B represented by a certain image feature C, we 
deduce a convex function, denoted as h c ( B \K): 

h c( - B \K) :R miXm2 ^R. (4) 

Unlike the previous work (e.g., 0, |[T3l . OjJ, (18|) where the regularizer h(K) is independent of the observed image 
B, our regularizer (K) explicitly depends on the given blurry image and encodes information about how the blurry 
image B is related to the sharp image Iq. It is hence somewhat natural to anticipate that this regularizer h c ^ B \K) would 
depend on the sharp image too. But rather surprisingly, as we will show, under fairly broad conditions, we can come up 
with a very effective regularizer h c ^ B \K) that does not depend on any information about the sharp image Iq at all, and 
the desired kernel Kq can be approximately retrieved by minimizing h c( - B ^{K) directly. 

Equipped with such new regularizer h c ^ B \K), for the blind deconvolution problem, we jointly seek the sharp image 
Iq and the blur kernel Kq by solving the optimization problem (0. Experimental performance of our algorithm is a bit 
surprising: Even when the observed image is blurred to the extent that human eyes cannot recognize its details (e.g., 
Figure [TJ, it is still possible for our algorithm to restore a sharp version with recognizable details. In addition to empirical 
evaluations, theoretical results presented in this paper could also help understanding the blind deconvolution problem: 

- It is known that there are infinite number of ways to decompose a given blurry image B into the convolution of I 
and K Q. Nevertheless, while I (or its edge map) is assumed to be very sharp, we show that such decomposition 
tends to be unique (or approximately so). 

- Although it is widely observed and believed that edges are more important than smooth regions for image deblurring 
iflOll . |[T3l . |[T7ll . 03D, iflSI . there is still lack of good justification. To some extent, our analysis could explain why 
sharp edges are good features for deblurring. 

2 Such simple cases could be solved even without the regularizer h(K). More precisely, by carefully choosing the initial solution and controlling 
the number of iterations, it is possible to obtain satisfactory results based on optimizing (T} 1 191 . 
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eigenvalue index eigenvalue index 



Fig. 2. Example: blurring can significantly decreases the convolution eigenvalues of a sharp image, (a) A sharp image, (b) The blurry image 
created by convoluting the sharp image with a Gaussian kernel, (c) Plots of all (si = S2 = 18) convolution eigenvalues of the sharp and blurry 
images, using C = S. (d) The edges of the sharp image, using LoG. (d) The edges of the blurry image (note that the black area contains negative 
values), (e) Plots of convolution eigenvalues of the sharp and blurry images, choosing C = LoG. 



II. Blind Deconvolution via Spectral Properties of Image Convolution 

In this section, we present the details for designing the regularizer h c ^ B \K). The final algorithm for blind deconvo- 
lution will be given at the end of this section. 

A. Spectral Properties of an Image as a Convolution Operator 

1) Preliminaries: The concept of convolution is well-known. We briefly introduce it for the ease of reading. Let X 
and Y are functions of two discrete variables (i.e., X and Y are matrices), then the formula for the 2D convolution of 
X and Y is 

(X ® Y)(i, j)=J2X(i- u, j - v)Y{u 7 v), 

where (•)(«, j) denotes the (i,j)-th entry of a matrix. The convolution is linear and can be converted into matrix 
multiplication. Let v{-) be the vectorization of a matrix, then 

v{X®Y) = A klM {X)v(Y), VreM felXfe2 , (5) 

where Afc 1] fc 2 (-) is the Toeplitz matrix ll20ll of a matrix, and hi, are parameters. For a matrix X E K ilX ' 2 , its Toeplitz 
matrix A,k 1 ,k 2 (X) is an (^l + k\ — + &2 — l)-by-fciA;2 matrix. 

2) Convolution Eigenvalues and Eigenvectors of Images: In this work, an image / is treated as a matrix associated 
with a certain feature filter C: 

£(/) = C ® I. (6) 

Typically choices for the feature filter C include C = 5 (i.e., using original pixel values as features) or C = "Laplacian 
of Gaussian (LoG)" (i.e., using edge features). 

To formally characterize the procedure of transforming sharp images into blurry images, we define the so-called 
convolution eigenvalues and convolution eigenvectors as follows. 

Definition 2.1: For an image / e K ilXi2 represented by a feature filter C, its first convolution eigenvalue, denoted as 
CT f i 1 ) or viaxi 1 )' is defined by 

<rf(J)= max \\C(I)®X\\ F , s.t. \\X\\ F = 1, 

where si,S2 are called "sampling sizes" in this work. The maximizer to above problem is called the first convolution 
eigenvector, denoted as (I), 

Similarly, the z-th (i = 2, . . . , S1S2) convolution eigenvalue erf (I) is defined by 

of(J)= max ||£(/)®X|| F , s.t. \\X\\ F = 1, {X, K f(I)) = 0,Vj < i, 

where (•, •) denotes the inner production between two matrices. The maximizer to above problem is the z-th convolution 
eigenvector, denoted as nf (I). 

In the following, we summarize some of their properties which will be used later: 

- From (O, it can be seen that the convolution eigenvectors/values are exactly the right singular vectors/values of 
the Toeplitz matrix. So, for an image I with the associated Toeplitz matrix ^4. S1 , S2 (£(J)), its all S1S2 convo- 
lution eigenvalues (and eigenvectors) can be found by computing the Singular Value Decomposition (SVD) of 

(A S1 , S2 (C(I))) T A S1 , S2 (C(I)). 
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- Let (Tmini') denote the smallest (i.e., last) convolution eigenvalue of an image, then 

\\C(I) ®X\\ F > ai in (I) > 0,V\\X\\f = 1. (7) 

- If B — I ® K, K £ S, then (detailed proofs are in the appendix) 

af(B)<af(I),Vi = l,...,sis 2 ; (8) 

that is, the blurring effects generally reduce the convolution eigenvalues of an image. When the original image is 
sharp, the reduction amount can be very significant, as exemplified in Figure [2] In particular, when the edge features 
are used, Figure |2jf) shows that it is even possible to have > a^ lax (B). 

The properties in inequality ^ provide a way to formally define the concept of sharp image, which appears frequently 

in the articles related to deconvolution: An image / is called r-sharp if and only if er^ in (J) > r, where r > is a 

parameter. 

B. A Convex Blur Kernel Regularizer 

In this subsection, we derive a convex regularizer h c ( B \K), which tends to have the minimal value at the desired 
blur kernel Kq, 

1) Derivation: For ease of exploration, we begin with the "ideal" case that the blurry image B is exactly generated 
by the convolution of Iq and Kq, i.e., there is no noise: B = Iq <8> Kq. As convolution operators are associative and 
commutative, the effect of feature extraction is 

C(B) = £(7 ) ® K . (9) 

Here, £ is a predefined feature filter. For the rest of this paper, we consistently choose C = LoG, i.e., we use edge 
features by default (but our analysis applies to any convolution filters). 
By Definition 12.11 we have that 

\\C(B) ® 4{B)\\ F = af(B), Vl<i< Sl s 2) 
where the sampling sizes si, s 2 are taken as parameters. By (0, 

||(£(/ ) ®K a )®Kf(B)\\ F = of(B), VI < i < s lS2 . 
Since the convolution operator is linear, we further have 

K 4(B) _ of(fl) 



C(I Q 



K ®k?(B)\\ f f \\Ko®k?(B)\\ 



Notice that || ^gg*7^ J F = 1. By Q, ||£(7 ) ® X\\ F > ^ n (7 ), V||X|| F = 1. Thus we have the following 
necessary conditions that constrain possible K$: 

\\K ® K f(B)\\ F < VI < i < 8i8 2 . (10) 







Define a regularizing function on the kernel K as 



Then (O implies that 



h^(K )< (11) 

Omin( 7 0)) 2 

Note that crf(B) could be significantly smaller than cr^ in (/o) (see Figure|2]i, and a randomly chosen blur kernel may not 
satisfy ( TTOb or subsequently (fTTl i. Hence, the desired blur kernel Kq should have relatively smaller value for h c ^ B '(K). 
Or we could try to obtain an approximate estimate of the desired blur kernel Kq by minimizing: 

K = £iTgmmh c{B) (K), s.t. KeS, (12) 

where S denotes the (m,\m,i — l)-dimensional simplex, and rax,m 2 are the sizes of the blur kernel. 

It is easy to see that h c ^ B '(K) is a quadratical (hence convex) function, namely h^ B '(K) = (v(K)) T Hv(K) with 
the Hessian matrix H given by 

„ ( Anx ,m a (*f (B))) T A mi , ma («f (5)) 

where A mi ,m 2 { K f(B)) is the Toeplitz matrix of the i-th convolution eigenvector of B. 
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Fig. 3. Demonstrating the effectiveness of the regularizer h^ B >(K), using six synthetical examples. Left: The blur kernel with size 9x9. 
Right: The kernel estimated by solving 4 121 . 



2) Analysis: We now study how effective the regularizer h c ^ B \K) is by evaluating how close its minimizer Kq is 
to the true kernel Kq (proofs to all the theories can be found in the appendix). 

Theorem 2.1: Suppose B = Iq<3) Kq, Kq £ S, and Iq ^ 0. For the kernel Kq estimated by i\2\ . we have that 



\K -K, 



OF 



Mo)' 



The above theorem illustrates that the edges are better than the raw pixels as a feature for the recovery of blur kernel, 
because the edge features can achieve a smaller bound for the estimate error (see Figure |2). This to some extent 
corroborates the previous observations (e.g., ifTol . |[T3l . ifPTl . 1191 . 1 18 1) that the edge feature is a good choice for image 
deblurring. Note that in general cr^ in (/o) can be significantly larger than cr£ oa ,(B). For the example shown in Figure 
|2jf), we have computed the ratio ^ al (5)/(7^„(/o) = 0.15,0 then the above theorem suggests that the estimation 
error is upper bounded by 0.21. Figure [3] shows some more simulated examples of blur kernels directly estimated from 
a blurred Lena image, compared with the true kernels. Notice, that these kernels are obtained without any knowledge 
about the original image I at all! This puts a strongly correct prior on the desired kernel entirely based on the given 
blurry image. These simulated results clearly demonstrate the effectiveness of the proposed regularizer h c ^ B \K). 

Without any restrictions, it is known that there are infinite number of ways to decompose a blurry image B into the 
convolution of an image I and a blur kernel K Q. Interestingly, when the image I (or its edge image) is assumed 
to be very sharp (i.e., the smallest convolution eigenvalue is large), Theorem 12.11 suggests that the decomposition for 
given image B tends to be unique in a way that the allowable kernels should be very close to each other. The following 
corollary makes this statement more precise: We call an image I is r-sharp if cr^ in (7) > r with r = 2\f2a^ lax {B) / e 
for some small number e. Then we have: 

Corollary 2.1: Denote all possible r-sharp decompositions of a blurry image B as VL e B = {(K, I)\I®K = B,K 6 S}. 
For any two pairs (K' , Iq), (Kq, Iq) 6 fl £ B , we have that 



\K 



where e > is any small parameter. 

The above analysis are based on the assumption that there is no noise, i.e., B = I <E) Kq. Due to the fact the the blur 
in reality may not be uniform within an image, a more appropriate model is that B = Iq ® Kq + N, where N denotes 
unknown noise (errors). In this case, we have the following theorem to bound the estimate error. 

Theorem 2.2: Suppose B = I ® Kq + N, K e S, \\C(N)\\ F < e, and I ^ 0. For the kernel Kq estimated by 0, 
we have that 

K -K F 



(B) + ccond(B)^/siS2€ 



Mo) 



where ccond(B) 



= cr c 

max 



(B)/a^ nin (B) is the "convolution condition number" of B. 



C. Practical Blind Deconvolution via Alternating Minimization 

Theorem 12 . 21 illustrates that the estimation produced by (fT2l may not be so accurate with noise. Moreover, the estimated 
kernel by directly minimizing the regularizer is not always accurate enough even for noiseless cases (see Figure [3). To 
solve the overall blind deconvolution problem, we use the proposed regularizer h c ^ B \K) as an additional term to 
constrain the deconvolution problem. Namely, we jointly seek the sharp image Iq and the blur kernel Kq by minimizing 
the following objective function: 

min||B-I(g)iq! + A||W||i +ah ci - B \l<), s.t. KeS, 

3 One may have noticed that the &mi n (Io) shown in jlO\ (and from then on) actually refers to o"^, 1+mi _ 1 ^ S2+m2 _ 1 j (^o) other than cr^" 1S2 (^o). 
because the convolution operator can change the sizes. The number 0.15 is calculated with considering this detail. 
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where we have chosen /(/) = ||V/||i (i.e., total variation) and h(K) = h^ B '(K) in (|3j. 

Although the above problem is nonconvex, the initialization step is no long crucial due to the strong image-dependent 
kernel regularizer: Unlike the previous algorithms (e.g., [6], [10], |[l"3l , ifTTl , ifTSl ) which need to carefully chose the 
initial solution, we simply choose the observed blurry image B to be the initial condition for I. 

• While fixing the variable I, the blur kernel K is updated by solving minx \\B-I®K\\ F + ah c( ~ B \K), s.t. K e S, 
which is equal to the following quadratical programming: 

mm MB) -A mu „ l2 (lMK)\\ 2 + a(v(K)) T Hv(K), s.t. K £ S, 

where the Hessian matrix H is computed by dX3T >. and || ■ || is the ^2-norm of a vector. 

• While fixing the blur kernel K, the estimate of the image / is updated by 

min \\B-I®Kf F + A|| V7||i, 

which can be solved by any of the many non-blind deconvolution algorithms developed in the literature. In this 

paper, we simply use the fast method introduced by |4j. 
Unlike previous blind deblurring methods that need to carefully control the number of iterations, we run the iterations 
until convergence. Usually, our algorithm needs about 100 iterations to converge. 

On Choosing the Parameters. There are six parameters in total: The kernel sizes mi,m2, the sampling sizes si, S2, and 
the trade-off parameters a, A. Three of them, including mi,ro2 and a, need be set carefully. The kernel sizes mi,ro2 
are better to be a bit larger than the true blur size, which requires trying the algorithm several times to determine. It also 
needs some efforts to choose the parameter a. When a is too small, our algorithm will always converge to the no-blur 
explanation (i.e., I = B,K = S); when a is set to be very large, the recovered image is very sharp, but unnatural. 
Usually, there exists such a threshold a*: The optimal solution is always (I = B,K = S) while a < a*; the solution is 
often satisfactory while a is slightly larger than a* . 

After the kernel sizes being determined, we always set the sampling sizes as si — 1.5mi and S2 = 1.5m2. The 
parameter A has no significant influences. In our experiments, this parameter is chosen from the range of 0.001 to 0.002. 

Computational Cost. Due to the developments of fast non-blind deconvolution (e.g., JH), the procedure for updating the 
image variable / is already very fast. So we only analyze the costs for computing the Hessian matrix H and updating 
the kernel variable K. For simplicity, we assume that s\ = mi, S2 — m%. Then the complexity of computing the Hessian 
matrix H is 0(m\mfyri\n2 + m\m\), where ni,n,2 are image sizes. In each iteration, the Toeplitz matrix A mi ,m 2 {I) 
need be updated. So, the complexity of the blind deconvolution procedure is 0[n s {jn\Tn2n\n2 + TOfm^)), where n s is 
the number of iterations needed to converge. Overall, our current algorithm is not fast while deal with large kernels. To 
process a 300 x 300 image with 13x13 kernel, our algorithm needs about one minute. While dealing with a 500 x 500 
image with 31 x 31 kernel, the computational time increases to two hours. Nevertheless, it is possible to speed up the 
algorithm by fast Fourier transforms. We leave this as future work. 

III. Experiments 

A. Main Results 

We test with five examples (two synthetical, three real): The synthetical images are the convolution of 300 x 300 
natural images and 13 x 13 synthetical blur kernels; the real images are subregions (about 350 x 350) selected from large 
pictures captured by a NEX-5N camera (please refer to Figure [TJ. To show the advantages of the proposed method, we 
also test five state-of-the-art blind deconvolution algorithms, including irTOl . IfTTl . fl9l , IfTSl . and ff3l . 

Figure [4] shows the comparison results. To save space, we only show the best result of the previous algorithms for 
comparision. On the simple examples with easy blur kernels and tiny noise level (the first three examples in Figure 
|4j, our algorithm performs as well as the most effective baseline. While dealing with challenging cases (the last two 
examples in Figure [4j, where the blur kernels are complicated, it can be seen that our algorithm works distinctly better 
than the most competitive baseline. This is because that our regularizer h ct ^ B \K) contains strong and accurate priors 
about the blur kernels, whereas the kernel priors adopted by previous algorithms are too weak to handle such difficult 
deblurring tasks. In particular, the forth and fifth examples illustrate that it is possible for our algorithm to successfully 
handle some extremely difficult cases, where the blurry images are very unclear such that human eyes are unable to 
recognize their contents. 
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Fig. 4. Comparison results, using two synthetical and three real examples. Left: The observed blurry image. Middle: The best result among 
Fergus et al. 1191 , Shan et al. 1181 , Cho and Lee 1171 . Xu and Jia 1131 . and Cho et al. 1 10]. Right: Our result. For these five examples, the parameter 
a is set as 5050, 1050, 70, 4000, and 27000, respectively; the kernel sizes are, 13 X 13, 13 X 13, 17 X 17, 17 X 17, and 31 X 31, respectively. The 
parameters of the baselines are also manually tuned to best. 




Fig. 5. Some results obtained from the images with few edges. Left: The original sharp image. Middle: The blurred version of the sharp image. 
Right: The deblurred version produced by our algorithm. 



B. Diagnosis 

A basic assumption of our approach is that the edges of a sharp image are different before and after blurring. So our 
algorithm is less effective while the images contain fewer edges, as shown in Figure [5] Particularly, the third example 
of Figure [5] shows that the "true" blur kernel cannot be recovered if the original image has very few edges. Nevetheless, 
it is not very critical to consider such images as there is almost no difference between their sharp and blurry versions. 

Note that the forth example of Figure [4] may have non-uniform blur kernels, since the deblurred result favors the 
characters "0,P" over "S,T". This is true, as illustrated in Figure [6] While the blur is uniform within the whole image, 
the varying of a (within certain range) should not change the shape of the estimate of the blur kernel. 

IV. Conclusions and Discussions 

By studying how the spectrum of an image as a convolution operator changes before and after blurring, we have derived 
a convex regularizer on the blur kernel that depends on the given blurry image. For the blind deconvolution problem, we 
show that this convex regularizer is an effective prior that can deal with various types of realistic and challenging blurs 
(e.g., a combination of Gaussian and motion blurs). Both theoretical and experimental results have verified the validity 





Fig. 6. Non-uniform blur kernels. Left: The observed blurry image. Middle: The deblurred version produced by our algorithm with a = 1000. 
Right: The deblurred version produced by our algorithm with a = 4000. 



and effectiveness of the proposed prior. Notice that our regularizer harnesses certain necessary conditions on the blur 
kernel, but not sufficient. So it is entirely possible there might exist other, potentially more effective image-dependent 
regularizers for the blur kernel. 

Theorem 2.1 and the empirical results in Figure [3] even suggest that in theory, for certain class of sharp images, it 
might be possible to solve the blind deconvolution problem by avoiding the iterative minimization altogether. Namely, the 
problem might be solvable by using only two computational stable procedures: Firstly estimate the blur kernel without 
estimating the sharp image at all; and then recover the sharp image by performing non-blind deconvolution. However, 
we leave such investigations for future. 



from I(x, y) to I(x — u,y — v). 



Appendix 

Proof: Let T u ,v(') denote the transformation operator that shifts a 2D function /(• 
By the convexity of /(•), 

f{I<Z>K) = f(±2K(u,v)T u , v (I)) 

U.V 

= f(I), 

where the last equality is due to f(T u ,v(I)) = !{!)■ This is naturally satisfied, as T u .v(I) and / refer to the same image. 



Proof: Using the "min-max" half of the Courant-Fisher theorem, we have that 



mm 

y-f, 



max \\£(B)®X\\ F 

|| X || _p — 1 . 

j=l,"",i-l. (X,Y 3 )=0, 
3=1,- 

< min max \\C(B) ® X\\ p 
Y ]=K f(i), ||x||f=i, 

i=l,...,<-l. (X,Yj}=0, 
3=l,-,i-l. 

max \\C(B)®X\\ F . 

\\x\\r=i, 

j=l,-S-l. 



By (2), we have \\C(B) ® X\\ F < \\C(I) ® X\\ F and thus 

crf(B) < max 

l|X||jr = l, 

(X,«f(I)>=0, 
3=1,- ,i-l- 

< max 

\\x\\ F =x, 

(X,«f(J)>=0, 
3=1,- ,»-!• 

= of(/), 

where the last equality is due to the definition of the convolution eigenvalues. 
The proof of Theorem 2. 1 needs to use the following lemma. 



\£(B)®X\\ F 



\£(T)®X\ 
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Lemma A.l: If B ^ 0, then the Hessian matrix H defined by (13) is positive definite and 

h C{B) (X) > c SlS * \\X\\ 2 F ,VX € M miXm2 . 

{^maxK-B ) ) 

Proof: It could be calculated that 

h c {B)(x) S ^ > 1 v u K c ml 2 ^ nx)VHMx) 

where H x = J2i=i(-A mi ,m 2 (Ki(B))) T A mi , m2 (Ki(B)). The Toeplitz matrix A. mi ,m 2 ( K f (B)) is a linear operator of 
nf{B) and can be explicitly written as 

A mum2 ( K f(B)) = MKf(B))) T n i; (v( K f(B))) T n 2 ;---}, 

where {f2 3 }^.™J +Sl_1 ^ m2+;S2_1 ^ are a set of binary matrices and satisfy J2j(^j) T ^j ~ s i s 2^, and I is the mi x m 2 
identity matrix. Hence, 

sis 2 (mi+si-l)(ro 2 +S2-l) 

Hi = £ E (^) T K K f(B))(K^f(i?))) T % 

i=l i=l 

(mi+si-l)(m 2 +S2-l) sis 2 
(m 1 +s 1 -l)(m 2 +s 2 -l) 

E (%) T % 

= sis 2 2, 

which simply leads to h c( - B ^(X) > s lS2 \\X\\ 2 F /(a^ ax (B)) 2 . ■ 
Proo/: of Theorem 2.1 By (11) and the convexity of h c ^(-), 

h c(B \K -k ) < h^ B \K ) + h« B \K ) 

< 2h c{B \Ku) 
2s 1 s 2 



< 



By Lemma C.l, we also have 



h c W(K K ) > / lS * \\K Ko\\%. 



Hence, \\K - K \\} < 2(ai ax (B)) 2 /(*f nm (I )) 2 . 
Proof: Since Kq is deterministic, we have 



\\K' -KX\\ F = \\(K' -K ) + (k -KZ)\\ F 

< \\K' -Ko\\f + \\K -K%\\ f 

a min\ 1 0J a min\ 1 ) 

e e 

< - + -=£. 

- 2 2 

■ 

We first prove the following lemma. 

Lemma A.2: For any two matrices X and Y, we have 

\\x®y\\ f < ||x|| F ||y||i, 

where || • || is the £i-norm of a matrix. 

Proof: Decompose Y into the sum of its negative part (denoted as Y~) and nonnegative part (denoted as Y + ). By 
(2) and the convexity of matrix norms, 

\\X®Y\\ F = \\X®Y+ + X®Y-\\ F 

< \\X(g>Y + \\ F + \\X®Y~\\ F 

< wxWfWy+Wj. + ||A-|| F ||y-||i 
= ll^llFi|y||i. 
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Proof: Of Theorem 2.2 In noisy case, the inequality (10) need be changed to 

af(B) + \\C(N)® K f(B)\\ F 



\\K (g> K f(B)\\ F < 
< 
< 

Thus the inequality (11) becomes 



aHB)+44{B)\\i 
ofCg) + y^ie 



< 



(^inUo)) 2 

Then by following the proof procedure of Theorem 2.1, this theorem can be proven. 
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